Corpus: som_wikipedia_2021_10K

Other corpora

5.2.18 Words nearly always together in sentences

Strong sentence co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/together_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency together Qoutient
Qaramada Midoobay 45 44 41 1.18
Midoobay Qaramada 44 45 41 1.18
Addis Ababa 11 9 9 1.22
Ababa Addis 9 11 9 1.22
Aires Buenos 9 9 8 1.27
Awal Habr 9 8 7 1.47
Buenos Aires 9 9 8 1.27
Habr Awal 8 9 7 1.47
Radhi Jama 7 5 5 1.40
Sap Tonle 7 6 6 1.17
dhaban lambar 7 7 6 1.36
dhaban xigana 7 7 6 1.36
dhaban Lambarkan 7 6 6 1.17
lambar dhaban 7 7 6 1.36
lambar xigana 7 7 6 1.36
lambar Lambarkan 7 6 6 1.17
xigana dhaban 7 7 6 1.36
xigana lambar 7 7 6 1.36
xigana Lambarkan 7 6 6 1.17
Hutu Tutsi 6 6 5 1.44
254 msec needed at 2021-06-23 12:01